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ABSTRACT 

The segrosome of multiresistance plasmid TP228 
comprises ParF, which is a member of the ParA 
ATPase superfamily, and the ParG ribbon-helix- 
helix factor that assemble jointly on the parH 
centromere. Here we demonstrate that the distinct- 
ive parH site (~100-bp) consists of an array of de- 
generate tetramer boxes interspersed by AT-rich 
spacers. Although numerous consecutive AT-steps 
are suggestive of inherent curvature, parH lacks 
an intrinsic bend. Sequential deletion of parH tetra- 
mers progressively reduced centromere function. 
Nevertheless, the variant subsites could be 
rearranged in different geometries that 
accommodated centromere activity effectively re- 
vealing that the site is highly elastic in vivo. ParG 
cooperatively coated parH: proper centromere 
binding necessitated the protein's N-terminal 
flexible tails which modulate the centromere 
binding affinity of ParG. Interaction of the ParG 
ribbon-helix-helix domain with major groove bases 
in the tetramer boxes likely provides direct readout 
of the centromere. In contrast, the AT-rich spacers 
may be implicated in indirect readout that mediates 
cooperativity between ParG dimers assembled on 
adjacent boxes. ParF alone does not bind parH but 
instead loads into the segrosome interactively with 
ParG, thereby subtly altering centromere conform- 
ation. Assembly of ParF into the complex requires 
the N-terminal flexible tails in ParG that are con- 
tacted by ParF. 



INTRODUCTION 

The transmission of genetic information from generation- 
to-generation is a fundamental biological process that 
must take place with high fidelity. The molecular events 
that underpin accurate genome segregation in eucaryotes 
are comparatively well-described (1). In contrast, under- 
standing of the mechanism of procaryotic DNA segrega- 
tion is more rudimentary. However, the compact genetic 
modules that mediate the precise partitioning of plasmids 
are highly informative systems in which to unravel this 
process in precise detail (2). 

Four distinct classes of plasmid segregation cassette 
have been defined (3). The two most well-studied types 
each comprise a pair of autoregulated genes and a 
nearby centromere analogue. The first gene specifies an 
ATPase that either possesses Walker box motifs (ParA) 
or is an actin homologue (ParM), whereas the accom- 
panying gene encodes a centromere binding factor 
(CBF) (4-8). The CBF is a site-specific DNA binding 
protein that loads onto the centromere to produce a nu- 
cleoprotein complex of defined geometry (9-12). The 
ATPase does not directly contact the centromere, but 
instead interacts with the CBF to assemble the mature 
segrosome. In the case of ParM, ATP-induced 
filamentation from the segrosome propels each member 
of a plasmid pair in opposite directions to achieve segre- 
gation (13). ParA homologues also polymerize in response 
to ATP binding, a process that is influenced by the CBF 
and/or by DNA (14-21). The ParA filaments emanating 
from the segrosome may drive plasmids towards the 
cell poles, or retraction of the ParA polymers may draw 
plasmids in opposite directions away from the 
cytokinetic zone (4,15). Recent in vivo studies favour the 
latter (22,23). 
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The tri-partite segrosome of multiresistance plasmid 
TP228 comprises the ParA homologue, ParF, and the 
ParG CBF which assemble on the parH centromere 
(Figure 2). ATP binding promotes the polymerization of 
ParF into dynamic, extensive multistranded filaments that 
are implicated in segregation (15). The dimeric ParG 
protein comprises C-terminal regions that interlock into 
a ribbon-helix-helix (RHH) fold linked to a pair of 
flexible N-terminal extensions (24). The folded region 
harbours the major determinants for dimerization, for 
binding to the parH centromere and to the 0 F operator 
site, as well as for ParF interaction (18,24-26). The ParG 
mobile tails are also multifunctional. First, arginine 
fingers stabilize the transition state during nucleotide hy- 
drolysis by their partner proteins. The ParG N-terminal 
tail includes an arginine finger-like motif that stimulates 
ATP hydrolysis by ParF (18). This stimulation may be a 
crucial aspect of the cycle of ParF polymerization and 
depolymerization during segregation. Second, ParF poly- 
merization is stimulated by the ParG flexible tail (18). The 
tails either may reorganize or stabilize ParF filaments by 
tethering ParF monomers within a single protofilament or 
aligned protofilaments. Alternatively, ParG might cluster 
at points of polymer growth or disassembly (15,18). In this 
sense, ParG may play a role similar to formins and related 
factors that influence the elongation and disassembly of 
actin filaments in eucaryotes, or may be analogous to 
microtubule-associated proteins that modulate tubulin 
dynamics (19). Third, ParG binds to the O f operator 
during transcriptional repression of the parFG genes 
(25). O f comprises eight degenerate 5'-ACTC-3' boxes 
arranged in a combination of direct and inverted orienta- 
tion (26). Each tetramer motif recruits one ParG dimer, 
implying that the fully bound operator is cooperatively 
coated by up to eight dimers. The 0 F operator apparently 
has evolved with subsites that bind ParG dissimilarly to 
produce a nucleoprotein complex fine-tuned for optimal 
interaction with the transcription machinery (26). A tran- 
sient (3-strand element in the ParG mobile tail associates 
with the protein's folded RHH domain thereby further 
modulating the binding of the protein to the operator 
(25). The mechanism by which this interaction between 
flexible and folded domains affects DNA binding is 
elusive. 

CBFs have heterogeneous primary sequences that cor- 
relate with the diversity in plasmid centromere organiza- 
tion (27). The precise loading of each CBF onto its 
cognate centromere is a vital early step that is crucial for 
correct segrosome assembly and the subsequent cascade of 
events during partitioning. Here, the interaction of ParG 
with the distinctive parH centromere is dissected: parH is a 
complex multisubsite locus that nevertheless can accom- 
modate a variety of synthetic subsite re-arrangements for 
accurate segregation. Both direct and indirect readout of 
parH potentially are required for correct coating of the 
centromere by ParG emphasizing that an intricate set of 
interactions mediate the loading of the protein onto the 
site. The centromere binding specificity of ParG is 
enhanced by the protein's flexible N-terminal tails which 
also are necessary for recruitment of ParF to the mature 
segrosome. 



MATERIALS AND METHODS 

Strains, plasmids and molecular biology procedures 

Plasmids were propagated and analysed using Escherichia 
coli DH5a (28). Strain BL21 (Novagen) was employed for 
protein overproduction and plasmid partition assays were 
performed in the polA strain BR825 (29). Recombinant 
plasmids for overexpression of the parF and parG genes 
were described previously (30). ParG derivatives with 9, 19 
or 30 amino acid deletions of the N-terminal tail were 
produced from plasmids constructed elsewhere (25). The 
partition probe vector pFH450 is a derivative of the 
bi-replicon plasmid pALA136 (31,32). Plasmid pFH547 
comprises the parFGH region cloned in pFH450 (33). 
Plasmid pMW20 was constructed in two steps. First, a 
promoter-less parFG cassette was amplified from 
pFH547, digested with Sacl-Xbal, and inserted between 
the same sites in the arabinose-inducible expression vector 
pBAD30 (34) to generate plasmid pMW19. The 
arabinose-inducible parFG cassette then was amplified 
from pMW19, digested with Xhol, and inserted in the 
same site in pFH450 to produce pMW20. Derivatives of 
parH possessing a full complement of 5'-ACTC-3' boxes, 
but with one or more rearrangements (Figure 1), were 
constructed by inserting double-stranded oligonucleotides 
carrying the appropriate sequences and with EcoRV-Nsil 
compatible ends between the same sites in pMW20. 
Derivatives of parH bearing deletions of 5'-ACTC-3' 
boxes were constructed first by amplifying the appropriate 
regions from pFH547, cleaving the PCR products with 
BamHI-XhoI, and inserting between the same sites in 
pFH450. The arabinose-inducible parFG cassette from 
pMW19 was then inserted as an Xhol fragment in the 
same orientation in each case. The nucleotide sequences 
of the inserts in all plasmid constructs were verified. DNA 
cloning and other molecular biology procedures followed 
standard protocols. 

Plasmid segregation assays 

Segregation assays were performed using pFH450 or 
pM W20 derivatives that replicate at low copy number in 
strain BR825 as detailed elsewhere (33). Briefly, the 
relevant plasmid-bearing strains were grown for ~25 gen- 
erations without chloramphenicol selective pressure. 
Plasmid retention was then determined by replica plating 
colonies to agar medium with and without the antibiotic. 
The values presented are the means of at least three inde- 
pendent tests with typical standard deviations (SDs) of 
-10%. 

Protein production and purification 

The hexahistidine-tagged ParF and ParG proteins were 
overproduced and purified by Ni 2+ affinity chromatog- 
raphy as described previously (30). 

Gel retardation assays 

DNA fragments for gel retardation assays were PCR 
products amplified from appropriate plasmid templates 
using one primer bearing a 5' biotin label and a second 
unlabelled primer, or were generated by annealing 



5084 Nucleic Acids Research, 2011, Vol. 39, No. 12 



BAD 




parH F Retention (%) RCA 

0 F ^^>— c^H>^H<^b 52 0.88 
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Figure 1. Deletion and mutational analysis of the parH centromere. The segregation probe vector pMW20 is illustrated at the top. This plasmid 
replicates at medium copy number via the pMBl ori. Replication switches to a low copy number via the PI replicon in a polA mutant, but the 
plasmid is segregationally unstable in this background. The parFG genes are expressed from an arabinose-inducible promoter (Pbad) instead of from 
their native regulatory sequences. The distribution and orientation of the variant 5'-ACTC-3' motifs in the parH-0 F region are indicated by open 
arrowheads. The segments cloned in pMW20 and tested for segregation activity are shown. Tetramer boxes that were inverted from their normal 
orientation are denoted with filled arrowheads. The 5- and 11-bp insertions in the parH +5 and parH +n sites are indicated by filled bars. Segregational 
stability assays were conducted as outlined in the text. The relative centromere activities (RCA) associated with the sites are also shown compared to 
the activity conferred by the intact parH-0 F region. The pMW20 vector without parH-0 F had an RCA of 0.31. The values presented are the means 
of at least three independent tests with typical standard deviations of ~10%. 



complementary primers one of which was 5' biotinylated. 
Purification of the fragments and conditions for retard- 
ation assays were outlined in detail previously (26). 
Briefly, biotinylated DNA (2nM) was incubated for 
20min at 25°C in binding buffer [10 mM Tris-HCl, pH 
7.5, 50 mM KC1, 1 mM dithiothreitol, 5mM MgCl 2 , 
0.05mg/ml poly(dl-dC)] with the ParG concentrations 
shown in figure legends. Reaction mixtures were 
electrophoresed on 10% polyacrylamide gels in 0.5 x 
Tris-borate-EDTA (TBE) buffer for 30-90 min at 80 V 
at 22°C. DNA was transferred by capillary action to posi- 
tively charged nylon membranes (Roche), and the 
transferred DNA fragments were immobilized by UV 
crosslinking. The biotin end-labelled DNA was detected 
using the LightShift chemiluminescent EMSA kit 
(Pierce) (30). 

DNase I footprinting 

The preparation and purification of biotinylated PCR 
products, conditions for DNase I footprinting reactions 
with ParG, denaturing gel electrophoresis, and detection 



of biotinylated DNA followed procedures described in 
detail recently (26). 

Atomic force microscopy and data analysis 

For sample preparation in atomic force microscopy 
(AFM), freshly cleaved mica was functionalized with 
poly-L-lysine (PL) to support secure immobilization of 
DNA (35). The mica disc was incubated with 30 ul of a 
10 ug/ml aqueous PL solution for 30 s, subsequently 
washed with 4 ml of Millipore water and dried under a 
nitrogen stream. A DNA fragment encompassing the 
parH region was amplified using pFH547 as template. 
Protein-DNA binding reactions (20 ul) included the 
purified PCR fragment (2.5 nM) and ParG (1-3 uM) in 
binding buffer (10 mM Tris-HCl, 50 mM KC1, 1 mM 
EDTA, pH 7.5). After 15-30min, the mixture was 
diluted 30-fold in binding buffer and 30 ul of this 
dilution were immediately placed on the PL-mica. After 
60 s of incubation, the mica was rinsed carefully with 2 ml 
of Millipore water and allowed to dry under a gentle 
nitrogen stream. Measurements were performed with an 
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AFM instrument as described (36) and FESP tips (Veeco) 
in tapping mode. Fields of 1 x 1 urn were scanned at line 
rates of 1-2 Hz and resolution of 512 x 512 pixels. AFM 
images were plane corrected with SPIP software (Image 
Metrology, Denmark) and saved in bitmap format for 
further analysis with ImageJ software (version 1.41o, 
NIH, USA). The images were scaled to 2048 x2048 
pixels using bilinear pixel interpolation. The entire 
contours of the DNA molecules were then traced using 
the freehand line option and saved as A'j-coordinates. 
The traced contours were also marked in the images in 
order to pinpoint two xj-coordinates that define the 
region along each contour which was occupied by ParG. 
To this end, the points at which the height begins to 
increase relative to the free DNA were identified using 
the height threshold tool and their .\j-coordinates were 
defined using the point selection tool. To compare the 
regions occupied by protein with the putative binding 
sites in the DNA fragments, the saved xj-coordinates 
were characterized by their distances from the DNA 
terminus which is nearest the protein complex. These dis- 
tances, as well as total DNA lengths, were determined by 
summing the distances between the successive coordinates 
of the entire contours using Excel. 

Bending analysis 

Plasmid DNA carrying parH, O f and the 5'-end of parF 
was digested with various restriction endonucleases 
producing different fragments for curvature analysis. 
DNA was mixed with loading dye and analysed on 
equilibrated native polyacrylamide gels as outlined previ- 
ously (37). Gels were pre-run for ~3h until current and 
temperature remained constant. Electrophoresis was 
carried out in 1 x TBE at 150 V (8 mA) for ~4 h (migration 
distance of bromphenol blue ~14cm) at 4 or 23°C. Gels 
were stained for 30 min in an aqueous solution of ethidium 
bromide (lmg/1), followed by rinsing in water prior to 
documentation. The 1 kb Plus DNA ladder (Invitrogen) 
and the 1 kb DNA ladder (New England Biolabs) served 
as marker fragments together with plasmid fragments re- 
sulting from restriction digests running in the same lane as 
the parH fragment. Migration of all fragments was 
determined for each gel and calibration curves were 
plotted using the marker fragments (logarithm of 
number of base pairs versus distance migrated). The 
apparent sizes in the acrylamide gel relative to the calibra- 
tion curve were determined. 

RESULTS 

Defining and dissecting the parH centromere in vivo 

The O f operator upstream of parFG comprises eight 
variant 5'-ACTC-3' motifs arranged in a combination of 
direct and inverted orientation (Figure 1). The tetramer 
boxes are separated regularly by 4-bp AT-rich sequences. 
A single ParG dimer loads onto each 5'-ACTC-3' box, 
suggesting that the operator is cooperatively coated by 
as many as eight dimers during transcriptional repression 
of parFG (26). Inspection of the region further upstream 
of O f revealed a second cluster of 12 degenerate 



5'-ACTC-3' motifs (parH), also separated by AT-rich 
spacers (Figure 1). One of the boxes is inverted 
compared to the remainder. These 12 5'-ACTC-3' boxes 
are embedded in a set of longer repeats that were noted 
previously in this region and which were originally 
proposed as the putative centromere locus at which 
segrosome assembly occurs (4,30). 

As repeat motifs are characteristic of plasmid centro- 
meres (4), attempts were made to support the partitioning 
of a segregationally unstable test plasmid harbouring the 
full complement of 20 5'-ACTC-3' boxes, i.e. parH-Op, 
when the ParFG proteins were provided in trans from a 
compatible plasmid. The proteins were produced either 
from genes under the control of the native parFG expres- 
sion signals or from a lactose-inducible promoter. 
Expression of parFG from a variably inducible arabinose 
promoter was also tested. In addition, selected synthetic 
promoters with strengths from weak to high (38) were 
trialled. None of these approaches elicited improved seg- 
regation of the vector possessing the complete set of 
tetramer boxes compared to the same plasmid lacking 
the repeat sequences. As an alternative strategy, the 
parFG genes were inserted in the segregation probe 
vector, pFH450 (31), under the control of an 
arabinose-inducible promoter (Pbad; 34). This manipula- 
tion produced plasmid pMW20 that entirely lacks any of 
the natural regulatory sequences upstream of parFG 
(Figure 1). With arabinose induction, the plasmid was 
maintained at a frequency of 18 ± 3% during non- 
selective growth for approximately 25 generations in an 
E. coli polA mutant in which the plasmid replicates using 
the low copy number PI replicon. However, insertion of 
the parH-Op region 5' of the arabinose promoter 
improved retention to 59 ± 16%, a value close to that 
observed with the intact partition cassette (33). Similar 
retention values were obtained when the parH-0 F region 
was cloned elsewhere downstream of parFG in pMW20 
(data not shown). Thus, the parH-0 F region exhibits 
ectopic, centromere-like activity when located in cis to 
parFG. One advantage in expressing parFG from Pbad is 
that any contribution of O f to centromere activity can be 
examined independently of its regulatory functions. The 
conditions required for in trans activity of the ParFG 
proteins at parH-0 F have yet to be defined: the appropri- 
ate intracellular protein concentrations and/or the 
temporal pattern of parFG expression that is required to 
support centromere activity in trans may be difficult to 
replicate artificially. Alternatively, a DNA topological 
requirement or a positional effect of the genes and/or 
centromere may influence segregation activity. These 
possibilities require further investigation. 

As parH and 0 F both comprise arrays of 5'-ACTC-3' 
boxes, the independent efficacies of the two regions in 
centromere function were tested in pMW20. The parH 
region displayed centromere activity that was indistin- 
guishable from that of the complete parH-Op region 
(Figure 1). The operator locus alone also was an effective 
centromere, albeit with slightly less activity than parH: O f 
may have dual roles in transcriptional repression of parFG 
(26) and in centromere function. As the activity of parH 
was not enhanced appreciably by O f , the centromeric 
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properties of parH alone were characterized further. 
Progressive deletion of pairs of 5'-ACTC-3' boxes from 
parH was accompanied by concomitant gradual reduc- 
tions in the retention levels of the test plasmids 
(Figure 1). Notably, plasmids bearing only four (parH g 
12 ) or six (parH 1 _\2) tetramer boxes were maintained at 
approximately half the frequency conferred by full-length 
parH. 

The parH and O f sites display comparable centromere 
activities, but the arrangements of the 5'-ACTC-3' motifs 
in the two loci differ markedly: parH comprises eleven 
direct repeats and one inverted repeat (Figure 1), 
whereas O f consists of three direct repeats interspersed 
with five inverted boxes (26). To assess further the malle- 
ability of these sequences for partition activity, synthetic 
parH centromeres with reconfigured 5'-ACTC-3' boxes 
were cloned in pMW20 and tested in segregation assays 
(Figure 1). A site in which all of the tetramer boxes were 
oriented similarly was a proficient centromere (parH Dm ). 
Inversion of the six rightward motifs or the five leftmost 
boxes of parH caused only modest decreases in segrega- 
tion activity (parH Kim and parH Um , respectively). A site 
in which alternating tetramers in parH were inverted from 
their canonical orientation was a less effective centromere, 
although remained partially functional (/?a;-// ALT ). 
Insertion of one-half helical turn at the centre of parH 
only modestly affected centromere activity (parH +5 ), 
whereas insertion of a complete turn {parH +u ) reduced 
centromere function more severely. Thus, a variety of 
natural and artificial 5'-ACTC-3' box dispositions are 
viable for parH centromere action, although a minimum 
of eight repeats is required for efficient activity. Moreover, 
altering the relative helical positions of the two halves of 



parH was well-tolerated whereas maintaining these pos- 
itions, but increasing the distance between the halves, 
was more deleterious. 

Discrete binding of ParG to parH and O f in vitro 

The ParG protein binds to the Of operator to achieve 
transcriptional repression of the parFG genes (25,26). 
DNase I footprinting in vitro of the parH centromere 
region revealed that ParG also protects the entire set of 
12 5'-ACTC-3' boxes from digestion on both DNA 
strands (Figure 2). The AT-rich spacers separating 
certain boxes were slightly less well protected, most obvi- 
ously on the top strand. The DNA fragments used in these 
reactions harboured both parH and O f so as to ascertain 
whether ParG interacted continuously or discontinuously 
with the two loci: the zones of protection were separated 
by a ~5-bp window that remained fully accessible to 
DNase I (Figure 2). These data correlate with recent ob- 
servations that delimited the extent of ParG interaction 
with O f (26). 

As a second strategy to probe the interaction of ParG 
with parH, a fragment possessing the centromere (365 bp) 
was bound by ParG and visualized by AFM (Figure 3A). 
The images showed protein binding as discrete extended 
foci. The measured total DNA length was 117 ± 5nm 
which was somewhat shorter (~5%) than the 124 nm 
expected for B-form DNA of this length. Shortening to 
this extent has been observed previously by AFM of 
naked DNA (35,39), so the reduced parH fragment 
lengths observed here do not necessarily reflect 
ParG-induced compaction. Therefore, the approximate 
DNA helical rise was 0.32nm/bp for these surface prep- 
arations. To examine if the visible ParG binding 
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Figure 2. DNase I footprinting of ParG at the parH centromere and adjacent O f operator. The distribution and orientation of the variant 
5'-ACTC-3' motifs in the parH-0 F region are indicated by open arrowheads. The bent arrow indicates the putative parFG promoter (25,26). 
Footprinting reactions were performed as outlined in the 'Materials and Methods' section using PCR fragments biotinylated at the 5'-ends of 
either top or bottom strands. ParG concentrations (uM monomer, left to right): 0, 0.05, 0.1, 0.2, 0.6, 1.0 and 1.5. A + G. Maxam-Gilbert sequencing 
reactions. The relative dispositions on the top and bottom strands of the parH region that are protected from DNase I digestion are shown in the 
bottom panel. The 5'-ACTC-3' motifs are boxed. 
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102 ±17 bp 



182 ±22 bp 

Figure 3. AFM of ParG-DNA complexes. (A) Representative AFM images (500 x 500 nm) are shown of ParG bound to a PCR fragment that 
contained parH. Regions with bound ParG are visible as foci of increased height (white regions). (B) Histograms of distances between P end and P] 
and P 2 in DNA fragments that contain parH after correction for the estimated tip induced lengthening of protein complexes. A traced DNA contour 
was made whose consecutive vx-coordinates were used to calculate total DNA lengths and positions of ParG binding sites where P e „d is the DNA 
extremity that is located nearest the protein focus, and P| and P 2 are the coordinates that define the initial and end points, respectively, of the protein 
bound region. Data were fitted with a Gaussian function (curves) to calculate means and SDs. The numbers of base pairs are calculated based on the 
measured helical rise of 0.32nm/bp. Squares indicate the distances for the initial point, Pi, and circles mark the distances for the endpoint, P 2 . 
Summaries of the AFM analysis are shown at the bottom. The blue bar denotes the position of parH based on DNase I footprinting (Figure 2). 



corresponded to the parH region, the coordinates P! and 
P 2 , defining the foci along digitized traces of the DNA 
contours, were located as described in the 'Materials and 
Methods' section. The DNA terminus that was closer to 
the protein focus was determined and the positions of Pj 
and P 2 were specified as distances from this terminus. The 
distance distributions shown in Figure 3B comprised a 
correction of the tip induced lengthening of the detected 
protein region. For an approximation of this effect, it was 
assumed that the measured DNA width of ~10nm cor- 
responded to a broadening of 8 nm relative to the known 
DNA diameter of 2 nm. Therefore, the points determined 
initially were shifted by 4nm towards the centres of the 
foci. The distance maximum in the fragment containing 
parH (Figure 3B) was found at 102 ± 17 bp for position P] 
and 182 ± 22 bp for P 2 . These positions roughly define the 
ends of the parH site which is ~100-bp based on DNase I 
footprinting (Figure 2). In conclusion, the AFM measure- 
ments described here confirm the selective binding of 
ParG to parH within the probed fragment at the specified 
protein concentrations. Although the direction of the frag- 
ments is not defined in the AFM images, the observation 
of single protein foci together with the agreement between 
the measured distances and the parH location established 
previously (Figure 2) indicate that ParG binding occurred 
exclusively in this region. 

The parH centromere lacks intrinsic curvature 

The plasmid Rl parC centromere displays strong intrinsic 
curvature and is further distorted into a U-shaped struc- 
ture by binding of its cognate CBF to the two arms of the 



site (11,37). In contrast, the cenE centromere of plasmid 
pGENT shows modest inherent curvature which is not 
altered significantly by the binding of its CBF on a 
linear DNA fragment (40). A theoretical analysis of the 
inherent curvature of the parS centromere of plasmid 
PI suggests that the site is curved (C. Hoischen and 
S. Diekmann, unpublished data). Moreover, DNA 
bending by integration host factor (IHF) at the centre of 
parS permits the cognate CBF to span the two arms of the 
site (41,42). These different topological features may be 
instrumental in organizing functional segrosomes in dif- 
ferent systems. 

The sequence of a Hindlll-EcoRI fragment encompass- 
ing parH, O f and the 5'-end of parF (Figure 4) is AT rich 
(61.2%) and contains blocks of AT nucleotides. These 
mostly comprise four or five consecutive AT steps suggest- 
ing that this region might be curved and show anomalous 
gel migration. Inherent DNA curvature has clearly iden- 
tifiable properties. First, curvature can be measured as 
retarded migration in polyacrylamide gel electrophoresis. 
Second, curvature is most evident at moderate tempera- 
tures and decreases at elevated temperatures (43^15). The 
quotient apparent sequence length divided by known 
sequence length is termed the ^-factor with an experimen- 
tal error in /c-factor measurement of ±0.03. Variation of 
curvature values in normal DNA sequences is between 
^-factor values 0.98 and 1.05. Applying this experimental 
strategy, we recently detected DNA curvature with large 
A>factors for yeast centromeres [^-values up to 1.20; (46)] 
and the plasmid Rl parC centromere [k > 2.0; (37)]. The 
parH locus and flanking regions were inserted in pUC18 
and a set of overlapping fragments were generated by 
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restriction enzyme digestion (Figure 4). Fragment migra- 
tion relative to marker fragments in different lanes, as well 
as to plasmid fragments in the same lane, was analysed in 
6% polyacrylamide gels under different temperature con- 
ditions, and in agarose gels. Apparent fragment sizes were 
determined relative to marker fragments with normal mi- 
gration. Most of the test fragments from the region sur- 
rounding parH did not show any appreciable gel 
retardation under standard conditions (23° C, TBE 
buffer) (Figure 4). Only rather weak retardation of 
a 520-bp Hindlll-EcoRI fragment (A-factor = 1.08), a 
267-bp Agel-MunI fragment (A-factor = 1.06), and a 
219-bp Munl-EcoRI fragment (A-factor = 1.06) was 
measured. The A-factors of all tested fragments slightly 
increased when the temperature was reduced to 4°C. 
However, a weak moderation in the A-factor was evident 
for the Agel-MunI fragment at the lower temperature. 
Although this analysis suggests some secondary structure, 
perhaps DNA curvature, resulting in a slightly anomalous 
gel migration, it reveals that the DNA sequence that 
includes parH-0 F is not substantially curved when tested 
in a linear fragment. This conclusion was supported by the 
analysis of permuted fragments of identical size and 
sequence context produced from cloning the parH-Op 
region in a bending vector: these fragments did not 
display intrinsic curvature in gel electrophoresis (data 
not shown). 

The clustered 5 -ACTC-3' motifs in parH provide a high 
affinity docking site for ParG 

To parallel the in vivo deletion analysis of parH (Figure 1), 
DNA fragments with decreasing numbers of 5'-ACTC-3' 



boxes (Figure 1) were tested for ParG binding in vitro. 
Full-length parH was assembled into a single complex at 
the lowest ParG concentration that was examined in gel 
retardation assays (Figure 5A). In contrast, ParG incom- 
pletely bound a substrate lacking two 5'-ACTC-3' boxes 
even at the highest protein concentration (parH 3 _i 2 ). 
Multiple nucleoprotein complexes with intermediate mi- 
grations were also detectable, suggestive of partial occu- 
pancy of the motifs by ParG. Target fragments possessing 
only eight, six or four tetramer boxes each were 
bound progressively less avidly by the protein (parH^ij, 
parHj-u and parH 9 _ l2 , respectively), and a single pair of 
5'-ACTC-3' motifs was sufficient only for very weak and 
limited binding (parH u _i 2 ) (Figure 5A). DNase I foot- 
printing of fragments possessing different numbers of 
motifs confirmed a positive correlation between ParG 
binding and increasing motif number (Figure 5B). 
Footprinting also demonstrated that ParG maintained 
specificity in protecting the regions harbouring the 
5'-ACTC-3' boxes from digestion even when presented 
with as few as four boxes. 

The variable tetramer motifs in parH comprise either 
canonical 5'-ACTC-3' boxes or derivatives of this 
sequence (Figure 2). A single 5'-ACTC-3' tetramer is in- 
sufficient for ParG binding in vitro (26). Thus, to assess 
whether ParG recognizes the assorted tetramer motifs 
with different avidities, the centromere was subdivided ex- 
perimentally into pairs of motifs flanked by their natural 
AT-rich spacers {parH\_ z to parH u _\ 2 ) (Figure 6A). 
Double-stranded oligonucleotides bearing these sequences 
were tested in gel retardation assays with titrations of 
ParG. The subsites displayed distinct binding patterns 
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Figure 5. ParG binding to parH subsites possessing decreasing numbers of 5'-ACTC-3' boxes. (A) Gel retardation analysis of ParG with 
the full-length, biotinylated parH centromere containing 12 5'-ACTC-3' motifs, or with fragments possessing 10 {parH 3 _ n ), 8 (parH 5 _ i2 ), 
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with ParG: parH\- 2 and parH 5 _ 6 were recognized most 
strongly, whereas ParG bound the parH 9 _ X0 fragment 
least well (Figure 6A). Other motif pairs showed inter- 
mediate binding properties. 

To investigate further the differences in ParG affinities 
for different parH subsites, ParG binding to the 
biotinylated parHi_ 2 oligonucleotide was challenged with 
increasing concentrations of each of the unlabelled 
subsites (Figure 6B). The parH x _ 2 and parH 5 _ 6 fragments 
competed most efficiently, whereas parH 7 __ s and parH g _ w 
were weak competitors. Thus, gel retardation assays and 
competition data revealed that ParG recognizes different 



centromere subsites with different affinities (parH\_ 2 ~ 
parH 5 _ 6 > parH X \_\ 2 x parH 7 _ s « parH 3 _ 4 > parHc,_ l0 ). 
These differences likely arise both from dissimilarities in 
the tetramer box sequences as well as from variations in 
the intervening AT-rich spacers in parH, both of which 
contribute significantly to DNA binding by ParG (26). 

The influence of the mobile N-terminus of ParG on 
centromere interaction 

The flexible N-terminal tail of ParG is multifunctional, 
harbouring an arginine finger-like motif that stimulates 
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Figure 6. ParG binding to adjacent pairs of 5'-ACTC-3' boxes from the parH centromere. (A) Gel retardation assays of ParG with fragments 
possessing pairs of tetramer boxes from parH. The fragments were generated by second-strand synthesis of 49-nt single-strand oligonucleotides 
bearing the relevant boxes at the centres of the fragments. The single-stranded oligonucleotides each included a common priming site for 
second-strand synthesis by a 5'-biotinylated primer. Thus, the fragments contained the relevant boxes and immediate flanking regions, but no 
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unbound DNA and ParG-DNA complexes, respectively. (B) Competition gel retardation assays of ParG binding to the parH l _ 2 fragment. Binding 
reactions containing ParG (4uM), a biotinylated fragment (2.5 nM) harbouring the parHi_ 2 pair of 5'-ACTC-3' boxes that are strongly bound by the 
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Reactions were analysed further as described in the 'Materials and Methods' section. Unlabelled competitor DNAs contained the same sequences 
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nucleotide hydrolyis by the partner ParF ATPase, as well 
as possessing sequences that promote ParF polymeriza- 
tion (18). Moreover, the interaction of ParG with the O f 
operator is modulated by transient associations be- 
tween the flexible N-terminal and folded C-terminal 
domains in complex with the target DNA (25). To 
examine whether the tail also influences interaction with 
the parH centromere, derivatives lacking 9, 19 or 30 amino 
acids from the ParG N-terminus (25) were tested 
for centromere assembly (Figure 7). This progressive 
truncation of the mobile tail was accompanied by 



increasingly stronger interactions of the ParG deletion 
derivatives with the centromere both in gel retard- 
ation assays and DNase I footprinting. Notably, at 
protein concentrations at which ParG only weakly 
shifted parH into a series of intermediate nucleoprotein 
complexes in retardation assays, A30ParG assembled the 
DNA into a major retarded complex (Figure 7A). In 
addition, A30ParG fully protected the parH site 
from DNase I digestion at protein concentrations that 
were insufficient for protection by the full-length protein 
(Figure 7B). 
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The preceding observation that deletion of the flexible 
tail increases parH binding by ParG apparently contra- 
dicts previous findings that A30ParG interacts more 
weakly at the O f locus in both gel retardation and 
surface plasmon resonance assays (25). However, the Op- 
site subsequently was found to be more extensive than 
originally described, comprising eight variant 
5'-ACTC-3' boxes instead of the five motifs used in 
previous experiments (26). The observation that 
A30ParG recognized a substrate containing three 
tetramer boxes less avidly than did the full-length 
protein (25) is also intriguing as it suggests that the 



flexible tail affects DNA binding differently dependent 
on the number of 5'-ACTC-3' boxes present in the target 
site. Therefore, the interaction of the ParG deletion de- 
rivatives with pairs of 5'-ACTC-3' motifs flanked by their 
natural AT-rich spacers {parH x _ 2 to parH n _x 2 ) 
(Figure 6A) was examined (Figure 8A). Unlike the 
stronger binding that the truncated derivatives displayed 
to the complete parH locus (Figure 7), binding of 
full-length and deletion versions of ParG was broadly 
similar to the parH x _ 2 , parH 3 _ 4 , parH s _ b and parH 7 _ s frag- 
ments. However, A19ParG generally interacted more 
weakly with the subsites than either full-length ParG, 
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A9ParG or A30ParG. Strikingly, the parH 9 _ l0 and 
parH\x_ n oligonucleotides were also bound less well by 
A30ParG than by the wild-type protein. Thus, complete 
truncation of the N-terminal tail elicited improved binding 
of ParG to the full-length centromere, but caused weaker 
binding when certain subsites of parH that contain fewer 
5'-ACTC-3' boxes were tested. 

As the variant entirely lacking the mobile tail binds 
parH at a lower protein concentration than does 



full-length ParG, relative centromere binding by ParG 
and A30ParG was assessed by challenging a preformed 
P&vG.parH complex with increasing concentrations of 
A30ParG (Figure 8B, left). The complex formed by 
ParG with parH migrates more slowly in gel retardation 
assays than does the A30ParG:;w// complex allowing 
ready discrimination between the two species. At a 1:1 
ratio, the Pa.rG.parH complex was entirely disassembled 
by A30ParG and only nucleoprotein complexes with 
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intermediate migration were observed. This likely reflects 
the formation of complexes that include both protein 
species: either ParG and A30ParG may bind simultan- 
eously to the centromere, or monomer exchange between 
the two species may produce ParG:A30ParG 
heterodimers. Even at a 20:1 ratio of A30ParG:ParG, 
only nucleoprotein species with intermediate mobility 
were evident indicating that the deletion protein cannot 
fully displace ParG from the centromere. The converse 
experiment was also performed. When a preformed 
A30P&rG:parH complex was challenged with the full- 
length protein, mixed complexes with migrations inter- 
mediate between those produced with ParG or A30ParG 
only were again evident (Figure 8B, right). However, at a 
high ParG:A30ParG ratio, parH was assembled entirely 
into a complex that comigrated with the complex observed 
only with ParG. Thus, ParG can actively dislodge 
A30ParG from the centromere more effectively than the 
reverse. 

In summary, truncation of the ParG flexible tail permits 
binding to the full-length parH centromere at lower 
protein concentrations than with the native protein 
(Figure 7). In contrast, binding of the deletion proteins 
to subsites possessing only two 5'-ACTC-3' motifs is 
only as strong, and in some instances weaker, than 
binding by full-length ParG (Figure 8A). Moreover, 
ParG not only competes better than A30ParG for the 
parH site, but can displace the deletion protein from the 
centromere more effectively than A30ParG can dislodge 
ParG (Figure 8B and C). The combined results suggest 
that the mobile tail modulates the association rate of 
ParG for the centromere, but that the stability of the 
P&xG-parH complex that lacks the tail may be perturbed. 

Like all DNA binding proteins, ParG binds to DNA 
non-specifically in vitro when the protein concentration 
is sufficiently high (24). To ascertain whether the ParG 
mobile tail contributes to its binding specificity, full-length 
ParG and A30ParG proteins were tested at concentrations 
up to 5 (iM with a biotinylated 60-bp fragment bearing the 
Epstein-Barr virus nuclear antigen binding site. This 
fragment lacks the characteristic 5'-ACTC-3' motifs 
recognized by ParG and is not bound by the protein at 
<3.5 (iM, a concentration that is sufficient for full binding 
of parH (Figure 8C, left). The non-specific DNA is shifted 
into a series of complexes with progressively slower migra- 
tion at elevated ParG concentrations. In contrast, the 
non-specific DNA is fully bound in the presence of 
A30ParG at a concentration of 3 uM (Figure 8C, right). 
Equivalent results were obtained with a second fragment 
of non-specific DNA (data not shown). Thus, removal of 
the flexible N-terminal tail from ParG also causes the 
protein to bind non-specific DNA more avidly. 

Loading of ParF into the segrosome 

The influence of ParF on the binding of ParG to the 
region upstream of parFG was examined previously, 
both in the absence and presence of ATP (30). However, 
this analysis was performed before the boundaries of the 
parH and O f loci were delineated accurately [Figure 1; 
(26)], and before the dynamic effects of ATP on ParF 



behaviour had been investigated (15,18). In view of this, 
the assembly of ParF into the segrosome was re-assessed 
using parH centromeric DNA defined here. First, the ParF 
protein does not bind the parH centromere in gel retard- 
ation assays either in the absence (30) or presence of ATP 
(data not shown). Second, the ParG.parH complex was 
supershifted progressively into a more slowly migrating 
complex that remained in the gel loading well as ParF 
was titrated into the reactions. At a ParF:ParG ratio of 
32:1, the centromere was supershifted entirely (Figure 9A, 
left). Third, ParF slightly modified the DNase I footprint- 
ing patterns that ParG produced at parH (Figure 9B). 
ParG was included in footprinting reactions at three con- 
centrations: 0.1 (low), 0.25 (medium) and 0.5 (high) uM. 
In each case, ParF was omitted or was included at 4 or 
20 uM. At low ParG concentration, no protection of the 
centromere was observable on top or bottom DNA 
strands either in the absence or presence of ParF. 
Incomplete protection of the site was apparent at 
medium ParG concentration. ParF modestly enhanced 
this protection. At high ParG concentration, ParG fully 
protected both the centromere and O f operator from 
DNase I digestion. The two loci were separated by the 
distinctive window of DNase I accessibility noted earlier 
(Figure 2). However, DNase I access to this zone was 
hindered by the presence of ParF which also slightly 
ameliorated protection at the other boundary of parH. 
The alterations in DNase I protection patterns induced 
by ParF were subtle, but reproducible, and demonstrate 
that centromeric DNA assembled into the segrosome may 
be organized differently than when bound only by ParG. 

ParG binds progressively more weakly to parH subsites 
that comprise decreasing numbers of 5'-ACTC-3' boxes 
(Figure 5A). The capacity of ParF to co-assemble with 
ParG into these subsites was tested. Supershifted 
complexes were weakly detectable with as few as six 
tetramer motifs derived from the centromere {parH 1 _i 2 ). 
In contrast, supershifted species were not evident with 
four (parH g _i 2 ) or two (parH u _i 2 ) 5'-ACTC-3' boxes 
(data not shown). Indeed, the ParG-only complexes dis- 
appeared at the highest ParF concentrations used in these 
experiments indicating that the complexes are sufficiently 
unstable that ParF can strip weakly bound ParG from the 
DNA via protein-protein interactions (30). 

The N-terminal tail of ParG is required for segrosome 
assembly 

The ParFGH segrosome was detectable in gel retardation 
assays when increasing concentrations of ParF were 
incubated with ParG and a parH fragment (Figure 9A, 
left). To examine whether segrosome assembly 
necessitated the N-terminal mobile tails of ParG, 
deletion versions of the protein were employed in retard- 
ation assays with ParF and the parH site. A biotinylated 
fragment (25 nM) bearing the parH locus was incubated 
with 0.5 uM of A9ParG, A19ParG or A30ParG (Figure 
7A) and titrated with ParF (0.2-20 uM). Segrosome for- 
mation was not detectable when A9ParG (Figure 9A, 
right), A19ParG or A30ParG were present. 
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Figure 9. Assembly of ParF into the segrosome. (A) Supershifting of the ParG.parH complex in the presence of ParF. A 320-bp biotinylated PCR 
product containing parH was incubated simultaneously with ParG (0.5 uM, left) or A9ParG (0.5 uM, right) and increasing concentrations of ParF 
and analysed by gel retardation assays. ParF concentrations (uM monomer, left to right): 0.2, 0.5, 1.0, 2.0, 4.0, 8.0, 16 and 20. Open and filled arrows 
indicate unbound DNA and protein-DNA complexes, respectively. The band above free parH in the left panel is likely to be the same fragment with 
an atypical secondary structure that is commonly observed in DNA preparations of the centromere and O f locus (26,30). (B) Influence of ParF on 
the DNase I protection pattern by ParG at the parH centromere. Footprinting reactions were performed as described in the 'Materials and Methods' 
section using a 244-bp PCR fragment encompassing the parH-O r region. Reactions contained ParG at low (L; 0.1 uM), medium (M; 0.2 uM) or high 
(H; 0.5 uM) concentrations, and ParF at 0, 4 or 20 uM (left-right in each case). Reactions were performed on both top and bottom DNA strands. 
The major zones of ParG protection are indicated by the shaded rectangles, and the regions of altered pattern observed in the presence of ParF are 
denoted by the adjoining hatched boxes. 



Thus, removal of the flexible tail abolishes segrosome 
assembly in gel retardation assays. 



DISCUSSION 

Plasmid centromeres are diverse. They typically comprise 
tandem repeats whose numbers, lengths, sequences, orien- 
tations and spacing are highly variable from 
replicon-to-replicon (4,5). Although characteristically pos- 
itioned either upstream or downstream of the correspond- 
ing segregation genes, atypical centromeres that are 
situated more remotely have been described (47,48). 
Moreover, there are examples in which more than one 
plasmid locus functions as an independent centromere, 
or in which sites may act synergistically (49,50), suggesting 
that cross-talk between segrosomes assembled at discrete 



positions may contribute to partitioning in certain cases. 
Among the most well-studied examples, the ParB protein 
uses separate domains to associate with two different 
types of motif in the parS centromere of the PI plasmid, 
as well as to bridge sites located on different molecules. 
ParB binding is facilitated by the DNA bending protein 
IHF, producing nucleoprotein complexes with distinctive 
topologies (9,51). In the case of the Rl plasmid, the inter- 
action of the cognate CBF with the parC centromere gen- 
erates ring-like, superhelical complexes (10-12). These 
characteristic protein-DNA superstructures are 
recognized in turn by the polymerizing ATPase (52) to 
form the mature segrosome. The parH centromere of 
plasmid TP228 was proposed to comprise a short set of 
~20-bp iterated sequences based on bioinformatics 
analysis (30). However, in light of recent work that 
defined the core binding site of the ParG protein as a 
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5'-ACTC-3' box flanked by AT-rich sequences (26), we 
have reappraised the organization of the parH locus here 
(Figure 1). The region 5' of the parFG genes harbours 20 
degenerate 5'-ACTC-3' motifs separated by AT-rich 
spacers. The complete array of tetramer boxes, a subset 
of eight motifs that comprise the O f operator (26), and the 
12 distal boxes (parH) each were proficient centromeres, 
although the independent activity of O f was modestly 
weaker than that of parH alone. Remarkably, synthetic 
parH sites in which multiple tetramer boxes were 
inverted relative to their normal orientation also were ef- 
fective centromeres, albeit none were as competent as the 
wild-type locus. This observation parallels previous 
findings that inversion of tetramer boxes did not 
diminish ParG binding at the O f locus (26). ParG 
dimers may be able to load onto the native tetramer 
boxes in the artificial parH sites and subsequently tether 
additional dimers non-specifically to neighbouring, 
mutated boxes through protein-protein interactions. 
This may produce a complex that mimics the wild-type 
configuration sufficiently that the ParF factor can still be 
recruited to assemble a functional segrosome. Thus, 
segregation may require only that a sufficient number of 
ParG dimers assemble on the centromere, permitting 
ParF to interact effectively with ParG displayed on a var- 
iety of promiscuous tetramer box arrangements. The 
absence of intrinsic curvature in parH fits with this 
concept as it reveals that the centromere is not locked 
into a single topology. Accordingly, insertion of a 
half-helical turn at the centre of the site was more 
well-tolerated than insertion of a full turn indicating that 
distance constraints between the halves of the site may be 
more crucial than their spatial positioning relative to 
helical phase. Thus, the site may be a tractable element 
that potentially can be flexed, if required, when complexed 
with ParG and ParF in the segrosome. Analogously, the 
ParB protein of the PI plasmid can interact with multiple 
binding motifs in the parS centromere to generate a 
variety of nucleoprotein configurations (51). How the 
proposed plasticity of the ParG-parH interaction inte- 
grates with loading of ParF and the subsequent segrega- 
tion process that requires directional plasmid movement 
remains to be resolved. 

The antiparallel (3-strands within RHH factors such as 
ParG comprise their principal DNA binding determin- 
ants, directing the protein dimers to the major groove 
(53). However, the binding of ParG at parH also is 
influenced profoundly by the N-terminal flexible tail 
(~30 amino acids): derivatives with increasing truncations 
of the ParG tail shifted the centromere progessively more 
efficiently in gel retardation assays, as well as more readily 
protecting the full-length site from DNase I digestion 
(Figure 7). Nevertheless, subsites of parH are bound 
equally well by ParG and its deletion derivatives, or are 
bound less proficiently by the latter in certain instances 
(Figure 8A). Moreover, a version of ParG that entirely 
lacks the mobile tail competes less well for centromeric 
DNA when challenged with the full-length protein. 
Previous chemical shift mapping of ParG-0 F interactions 
highlighted the improved discrimination between the 
operator and non-specific DNA that the ParG mobile 



tail confers (25). Correspondingly, deletion of the tail in- 
creases non-specific DNA binding by ParG (Figure 8C). 
Bases in the 5'-ACTC-3' boxes in parH are likely con- 
tacted by residues in the anti-parallel (3-strands of ParG, 
whereas the mobile tail may provide phosphate backbone 
contacts that enhance the strength of the interaction. In 
addition, a transient (3-strand previously detected in the 
mobile tail (25) may be implicated in enhancing the speci- 
ficity of the interaction with centromeric DNA by a mech- 
anism that has yet to be fully revealed. It is noteworthy 
that other RHH factors similarly possess disordered ex- 
tensions that modulate their DNA binding properties 
(54-58) indicating that this may be a widespread feature 
among this class of proteins. The weaker interaction of 
ParG deletion derivatives with certain parH subsites also 
may reflect compromised dimer-dimer interactions in 
these shorter substrates compared to the full-length 
sequence on which an array of dimers can cooperatively 
assemble. 

Direct readout of sequences by DNA binding proteins 
involves interaction with specific bases in the binding site. 
In contrast, indirect sequence readout entails modulation 
of protein-DNA complex formation by bases that are not 
specifically contacted by the protein. These non-contacted 
bases may contribute, for example, to distortion of the 
DNA thereby promoting stable nucleoprotein complex 
formation by assisting in the precise alignment of amino 
acid residues and bases contacted during direct readout 
(59). We speculate that the 5'-ACTC-3' boxes in parH 
are read out directly by ParG and that the intervening 
AT-rich spacers participate indirectly and that these 
combined contributions explain the dissimilar binding 
patterns of ParG to different parH subsites. For 
example, the parH x _ 2 and parH-i_ % subsites both possess 
identical pairs of 5'-ACTC-3' boxes, but different spacer 
and flanking sequences whose indirect read out by ParG 
elicits different affinities for the subsites in gel retardation 
assays (Figure 6). Although the parH site does not exhibit 
global curvature (Figure 4), the AT-rich flanking regions 
may display localized flexibility that promotes the 
association of ParG dimers assembled on adjacent 
tetramer boxes. Accordingly, increasing the AT content 
of the spacer in a natural binding site ameliorates the 
interaction with ParG, whereas this interaction is 
disimproved by other mutations (26). Thus, direct and 
indirect readout, combined with cross-talk between 
protein dimers loaded onto neighbouring tetramer 
boxes, may cooperatively promote coating of the parH 
site with an array of interacting ParG dimers. The 
flexible tails of ParG provide another layer of complexity 
to the centromere interaction: the tails modulate the DNA 
binding affinity of ParG. This modulation may be via a 
direct interaction with the the parH site, or by cooperative 
ParG dimer-dimer interactions mediated through the 
N-terminal tails. Elucidation of the tertiary structures of 
ParG complexed with the centromere will clarify further 
the nucleoprotein interactions that underpin segrosome 
assembly. 

Like ParG, the ParR and co proteins encoded by 
plasmids Rl/pSK41 and pSM19035, respectively, are 
RHH factors that bind their cognate centromeres. 
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However, the sequences of the ribbon motifs that are 
involved in DNA recognition differ between the various 
proteins, and the DNA repeats that the proteins contact 
also are dissimilar (10,12,21,26,52). ParR assembles on 
DNA as a dimer-of-dimers, each dimer binding to one 
half-site of a 20-bp repeat that occurs four times in the 
pSK41 parC centromere (10). The co protein recognizes 
multiple sets of contiguous heptad repeats that are 
arranged in different numbers and combinations of 
direct and indirect orientation on pSM19035: each set of 
repeats displays centromere-like properties (50,60). Like 
the ParR-parC and ParG-parH interactions, co coopera- 
tively binds a pair of contiguous DNA motifs as a 
dimer-of-dimers. Strikingly, both ParR and co assemble 
into nucleoprotein superstructures that enwrap the centro- 
meric DNA and which may be required for recruitment of 
the polymerizing NTPase (10,12). Further analysis is 
needed to determine whether the extensive repeats that 
comprise the parH site (Figures 1 and 2) also act as a 
scaffold for assembly of a nucleoprotein superstructure 
onto which ParF docks. 

The ParF component alone does not interact directly 
with the parH site either in the presence or absence of 
ATP. Instead the ATPase assembles into the segrosome 
via interactions with ParG to generate the mature 
complex. Both the RHH and flexible domains of ParF 
contact ParG, the mobile regions being required to stimu- 
late ATP hydrolysis by ParF as well as promoting ParF 
polymerization (18). The unstructured tails are also essen- 
tial for recruitment of ParF to the segrosome (Figure 9A, 
right). Assembly of the intact complex with full-length 
ParG was detectable as a higher order, supershifted 
species that failed to enter the polyacrylamide matrix in 
gel retardation assays (Figure 9A, left). DNase I footprint- 
ing patterns at the edges of the parH site were altered 
modestly by the addition of ParF into reactions that con- 
tained ParG, suggestive of perturbations of the centro- 
mere boundaries when the tripartite complex is 
assembled. These perturbations may parallel observations 
with other segrosomes in which the centromere can be 
extensively distorted or wrapped about the CBF in 
higher order nucleoprotein superstructures (10-12). A 
plethora of questions remain unanswered about the func- 
tioning of centromeric DNA in tripartite segregation 
complexes that also involve a CBF and a ParA-type 
ATPase. What is the evolutionary force that drives the 
extensive sequence and organization diversity among 
plasmid centromeres? Does this diversity result in 
profound variations in the architecture of segrosomes 
assembled on different episomes, or are there unifying 
themes in the organization of these complexes? An evolu- 
tionarily conserved segregation complex seems to exist in 
eucaryotes, although it is modulated between different 
organisms. Does the same pertain among procaryotic 
segrosome structures? How are homologous ParA 
proteins captured by apparently dissimilar CBF-centro- 
mere complexes? The solutions to these and other conun- 
drums will illuminate further how centromere sequences 
underpin the segregation process. 
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